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Abstract — In this paper, we analyze the frequency-hopping orthogonal frequency-division multiplexing 

*~J ! (OFDM) system known as Multiband OFDM for high-rate wireless personal area networks (WPANs) based 

i on ultra-wideband (UWB) transmission. Besides considering the standard, we also propose and study system 

CN ' performance enhancements through the application of Turbo and Repeat-Accumulate (RA) codes, as well 

> '. 

, as OFDM bit-loading. Our methodology consists of (a) a study of the channel model developed under IEEE 

G\ . 

^ ', 802.15 for UWB from a frequency-domain perspective suited for OFDM transmission, (b) development and 

^ ; quantification of appropriate information-theoretic performance measures, (c) comparison of these measures 

O ' 

O 



with simulation results for the Multiband OFDM standard proposal as well as our proposed extensions, and 
(d) the consideration of the influence of practical, imperfect channel estimation on the performance. We 
find that the current Multiband OFDM standard sufficiently exploits the frequency selectivity of the UWB 



channel, and that the system performs in the vicinity of the channel cutoff rate. Turbo codes and a 

_ _ i 

reduced-complexity clustered bit-loading algorithm improve the system power efficiency by over 6 dB at a 
data rate of 480 Mbps. 

Index terms: Ultra wideband, Multiband OFDM, diversity, bit loading, information-theoretic measures 
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1 Introduction 

Ultra-wideband (UWB) radio has recently been popularized as a technology for short-range, high data 
rate communication and locationing applications (cf. e.g. [T]). The IEEE 802.15 standardization group, 
responsible for wireless personal area networks (WPANs), organized task group 3a to develop an alternative 
physical layer based on UWB signaling |2J. There were two main contenders for this standard: a frequency- 
hopping orthogonal frequency-division multiplexing (OFDM) proposal known as Multiband OFDM and a 
code-division multiple access (CDMA) based technique. 

In this paper, we consider the proposed Multiband OFDM standard [3] (also recently standardized 
by the ECMA [3|). Multiband OFDM is a conventional OFDM system [5J combined with bit-interleaved 
coded modulation (BICM) [6J for error prevention and frequency hopping for multiple access and improved 
diversity. The signal bandwidth is 528 MHz, which makes it a UWB signal according to the definition of the 
US Federal Communications Commission (FCC) [T], and hopping between three adjacent frequency bands 
is employed for first generation devices 0. Thus, the Multiband OFDM proposal is a rather pragmatic 
approach for UWB transmission, which builds upon the proven BICM-OFDM concept. 1 

The objective of this paper is to study the suitability and to analyze the (potential) performance of 
Multiband OFDM for UWB transmission. Furthermore, we propose system performance enhancements 
by applying capacity-approaching Turbo and Repeat-Accumulate (RA) codes and by using OFDM bit- 
loading. These specific techniques were chosen because of their potential for improved system performance 
without requiring substantial changes to other portions of the Multiband OFDM system, nor requiring 
major increases in complexity. Since our investigations rely on the new UWB channel model developed 
under IEEE 802.15 [7J, we first analyze this channel model in the frequency domain and extract the relevant 
statistical parameters that affect the performance of OFDM based transmission. In particular, the amount of 
diversity available in the wireless channel as a function of the signal bandwidth is examined. As appropriate 
performance measures for coded communication systems, we discuss the capacity and cutoff rate limits 
of BICM-OFDM systems for UWB channels. In this context, since one limiting factor of performance in 

practical and especially in wideband BICM-OFDM systems is the availability of high-quality channel state 
throughout this paper, the term "Multiband OFDM" refers to the particular standard proposal [3], whereas "BICM- 
OFDM" refers to the general concept of combining BICM and OFDM. 
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estimates, the effect of imperfect channel state information (CSI) at the receiver is specifically addressed. 
Furthermore, the information-theoretic performance limits are compared with simulated bit-error rate (BER) 
results for the Multiband OFDM proposal and the extensions introduced herein. 

The literature on Multiband OFDM systems and their performance is relatively sparse. In |S] the 
authors present an overview of the Multiband OFDM system as well as performance results. The authors 
of P] consider properties of the UWB channel related to Multiband OFDM, and independently arrive at 
results agreeing with our findings in Section 13. II 2 As an extension to the standard proposal, simplified 
Low-Density Parity-Check (LDPC) codes are considered in [TU] in order to improve the power efficiency of 
the Multiband OFDM system for a subset of the proposed data rates. The authors of (TTJ consider the 
application of a clustered power allocation scheme to Multiband OFDM. However, this scheme attempts 
to maximize throughput and therefore does not provide fixed data rates compatible with the Multiband 
OFDM standard proposal. In [12J the authors present a space-time-frequency coding scheme for Multiband 
OFDM. A subband and power allocation strategy for a multiuser Multiband OFDM system is given in [13J, 
but each user in the system uses a fixed modulation (i.e. no per-user bit allocation is performed). We note 
that none of these previous works provide comparisons with relevant information-theoretic limits. 

The remainder of this paper is organized as follows. Section |2] describes the Multiband OFDM system 
and the performance enhancements we propose, as well as the UWB channel model under consideration. 
The properties of the UWB channel relevant for OFDM transmission are examined in Section El Sectional 
presents the capacity and cutoff rate analysis and numerical results. Simulation results for the Multiband 
OFDM system and the proposed extensions are presented and compared with the theoretical benchmark 
measures in Sectional and conclusions are given in Section EJ 

2 Multiband OFDM, Extensions, and UWB Channel Model 

In this section, the transmission system and channel model are introduced. We describe the transmitter of 

the proposed Multiband OFDM standard |3j as well as extensions to channel coding and to modulation. 

For the receiver we adopt a conventional state-of-the art architecture including channel estimation based 
2 We note that our conference paper (presented at ICC 2005) was submitted before [9_ appeared. 
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on pilot symbols. 

2.1 Transmitter 

The block diagram of the Multiband OFDM transmitter is shown in Fig [T^ ) . A total of ten data rates 
(from 53.3 Mbps to 480 Mbps) are supported by the use of different code puncturing patterns as well as 
time and/or frequency repetition. We present a description of the Multiband OFDM standard proposal |3J 
in parallel with our extensions to channel coding and to modulation. 

2.1.1 Channel Coding and Spreading 

A. Multiband OFDM Standard Proposal: Channel coding in the proposed standard consists of classical 
BICM |6J with a punctured maximum free distance rate R = 1/3 constraint length 7 convolutional encoder. 
A multi-stage block-based channel interleaver is used (see [3] for details). After modulation (described 
below), modulated symbols are optionally repeated in time (in two consecutive OFDM symbols) and/or 
frequency (two tones within the same OFDM symbol), reducing the effective code rate by a factor of 2 or 
4 and providing an additional spreading gain for low data rate modes. The channel interleaver length (300, 
600 or 1200 coded bits) depends on the spreading factor. 

B. Extension — Turbo Codes: We propose the use of Turbo codes [13] in order to improve the system 
power efficiency and more closely approach the channel capacity. We examined generator polynomials of 
constraint length 3, 4 and 5 as well as various interleavers (including s-rand [JH] and dithered relative prime 
[To] designs). Due to their excellent performance for the code lengths considered as well as reasonable inter- 
leaver memory storage requirements, we decided to adopt the generator polynomials and interleaver design 
developed by the 3rd Generation Partnership Project (3GPP) [T7]. For low data rates, the time/frequency 
spreading technique of the Multiband OFDM proposal is retained. We would like to maintain compatibility 
with the Multiband OFDM channel interleaver by having each coded block fit into one channel interleaver 
frame, as is done with the convolutional codes used in the standard. 3 However, to maintain compatibility 
at the lowest data rates would require a Turbo code interleaver length of only 150 or 300 bits. Due to the 

poor distance properties and resultant performance degradation associated with short-length Turbo codes, 
3 Note that keeping the block lengths short also reduces the memory requirements and decoding delay at the receiver. 
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at low data rates we consider both Multiband OFDM-compliant block lengths and longer blocks of 600 
input bits (the same length as used without spreading). 

C. Extension — RA Codes: The limited length of the Multiband OFDM channel interleaver motivates 
the consideration of serially-concatenated codes, where the interleaver is positioned between the constituent 
encoders and thus has a longer length. We consider nonsystematic regular RA codes [TH] due to their 
simplicity and good performance for the required code lengths. The time/frequency spreading mechanism 
described above is discarded, and low-rate RA codes (R = 1/4 or 1/8) are used. The interleaver between 
the repeater and accumulator is randomly generated (no attempt is made to optimize its performance). 

2.1.2 Modulation 

A. Multiband OFDM Standard Proposal: In the proposed standard, the interleaved coded bits are 
mapped to quaternary phase-shift keying (QPSK) symbols using Gray labeling. After the optional spreading 
described above, groups of 100 data symbols are used to form OFDM symbols with N = 128 tones. 

B. Extension — Bit-Loading: The UWB channel (see Section |2~2"|) is considered time-invariant for the 
duration of many packet transmissions. For that reason, it is feasible to consider bit-loading algorithms to 
assign unequal numbers of bits to each OFDM subcarrier [H]. Channel state information is obtained at the 
transmitter, either by (a) exploiting channel reciprocity (if the same frequency band is used in the uplink 
and downlink as in the standard proposal), or (b) some form of feedback (which may be required even if 
the same frequency band is used, since reciprocity may not apply due to different interference scenarios for 
transmitter and receiver). We consider loading for higher data rates (without time or frequency spreading) 
using two different OFDM bit-loading schemes. We selected the algorithm of Piazzo [19| (which loads 
according to the uncoded BER) due to its low computational complexity, and the algorithm of Chow, Cioffi 
and Bingham (CCB) |20J because it loads according to the information-theoretic capacity criterion, as well 
as for its moderate computational complexity. 

The data rates and OFDM symbol structure of the Multiband OFDM proposal are maintained by 
loading each OFDM symbol with 200 bits. Each tone carries from to 6 bits using Quadrature Amplitude 
Modulation (QAM) signal constellations with Gray or quasi-Gray labeling (note that 6 bit/symbol corre- 
sponds to 64-QAM, which is a reasonable upper limit for modulation on a wireless channel). Due to FCC 
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restrictions on the transmitted power spectral density, power loading is not used (all tones carry the same 
power). The target uncoded BER for the Piazzo algorithm is chosen 1CT 5 (cf. jl9j for details), but we 
found that performance is quite insensitive to this parameter. For the CCB algorithm, the signal-to-noise 
ratio (SNR) gap parameter T is either 6 dB (when convolutional codes are used) or 3 dB (for Turbo codes). 
When the algorithm is unable to determine a suitable loading an all-QPSK loading is used, cf. [2U] for 
details. 

C. Extension — Clustered Bit-Loading: One potential feedback-based method of bit-loading is for 
the receiver to determine the appropriate modulation for each tone and feed the loading information back 
to the transmitter. To lower the feedback transmission requirements and significantly reduce the loading 
algorithm's computational complexity, we propose a clustered loading scheme where clusters are formed by 
considering groups of D adjacent tones. As we found the CCB algorithm superior over the Piazzo algorithm 
in terms of achievable power efficiency (see Sections l4.4.1l and 15. 2p . we make the following modification to 
the CCB algorithm. We substitute Eq. (1) of [2U] with: 

1 _ f ( -^~t~~> 'mar gin \ 

6 W = d E log 2 ( 1 + SNR ^ fc ) • ^~^ ns ~ ) 
k=i ^ 

where SNR(i,k) is the signal-to-noise ratio of the k th tone in the i th cluster, ^ marg in is the system 
performance margin in dB (iteratively calculated by the CCB algorithm), and b{i) is the (possibly non- 
integer) number of bits allocated for each tone in cluster i. Using the modified algorithm to load 200/D 
bits on 100/-D clusters provides the final integer-valued loadings b{i) for each cluster. Finally, all tones in 
cluster i are assigned b(i) bits (i.e. the loading inside each cluster is constant). This modification causes 
the CCB algorithm to load according to the mean capacity of all tones in a cluster. 

2.1.3 Framing and Transmission 

The time domain signal is generated via inverse fast Fourier transform (IFFT) and a cyclic prefix of 32 
samples is appended. The radio frequency (RF) transmit signal hops after each OFDM symbol between 
three 528 MHz frequency bands with center frequencies at 3.432, 3.960, and 4.448 GHz (see |3J for more 
details). Transmission is organized in packets of varying payload lengths. Each packet header contains two 
pilot OFDM symbols (all tones are pilots) per frequency band, which are used at the receiver to perform 
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channel estimation (see Section I2.3.ip . 

2.2 UWB Channel Model 

For a meaningful performance analysis of the Multiband OFDM proposal, we consider the channel model 
developed under IEEE 802.15 for UWB systems j7]. The channel impulse response is a Saleh-Valenzuela 
model |2T] modified to fit the properties of measured UWB channels. Multipath rays arrive in clusters 
with exponentially distributed cluster and ray interarrival times. Both clusters and rays have decay factors 
chosen to meet a given power decay profile. The ray amplitudes are modeled as lognormal random variables, 
and each cluster of rays also undergoes a lognormal fading. To provide a fair system comparison, the 
total multipath energy is normalized to unity. Finally, the entire impulse response undergoes an "outer" 
lognormal shadowing. The channel impulse response is assumed time invariant during the transmission 
period of several packets (see [7] for a detailed description). 

Four separate channel models (CM1-CM4) are available for UWB system modeling, each with arrival 
rates and decay factors chosen to match a different usage scenario. The four models are tuned to fit 0-4 m 
Line-of-Sight (LOS), 0-4 m non-LOS, 4-10 m non-LOS, and an "extreme non-LOS multipath channel", 
respectively. The means and standard deviations of the outer lognormal shadowing are the same for all 
four models. The model parameters can be found in fO Table 2]. 

2.3 Receiver 

The block diagram of the receiver considered in this paper is depicted in Fig [TJd). We assume perfect 
timing and frequency synchronization. For this paper (with the exception of Section 13. lj) . we consider 
only channels CM1-CM3, where the cyclic prefix is longer than the delay spread of the channel impulse 
response. 4 Thus, after FFT we see an equivalent iV dimensional frequency non-selective vector channel, 
expressed as [5J, 

Y[k) = X d [k]H + N[k] , (2) 

where the vector notation Z[k] = [Zi[k] . . . Z N [k]] T is used (- T denotes transpose) and -X" d [fc] is the 

N x N diagonal matrix with elements Xi[k] at its main diagonal. Y^k], Xi[k], and Ni[k] are the received 
4 However, we note that the CM4 performance is very similar to that of CM3. 
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symbol, the transmitted symbol, and the additive white Gaussian noise (AWGN) sample on frequency tone 
i — l...N of the /cth OFDM symbol, respectively. The vector H contains the frequency domain samples 
of the channel transfer function on tones i = 1 . . . N and is assumed constant over the considered time 
span (see Section I2"2j) . 

The channel estimation, diversity combining, demapping, and decoding are briefly described in the 
following. 

2.3.1 Channel Estimation 

We implement a least-squares error (LSE) channel estimator for the time-domain channel impulse response 
(CIR) using the P pilot OFDM symbols for each frequency band transmitted in the packet header. For 
a more general treatment, we let P be a design parameter, but we note that P = 2 is proposed in [3]. 
The responses in different frequency bands can be estimated separately, since pilot symbols are transmitted 
for each band. The LSE estimator is chosen instead of minimum-mean-square error (MMSE) estimation 
because it does not require assumptions regarding the statistical structure of the channel correlations. 
Furthermore, it has been shown that LSE and MMSE estimation perform almost equally well for cases of 
practical interest [22] . 

The LSE estimator exploits the fact that the CIR has a maximum of L < N taps. Starting from fl2J), 
the frequency-domain vector channel estimate can be represented as (cf. e.g. |22| ) 

H = H + E, (3) 

where the channel estimation error vector (-t denotes Hermitian transpose) 

i p 

E = F NxL F\ xL ■ - Xl[k]N[k] (4) 

k=l 

is independent of H and zero-mean Gaussian distributed with correlation matrix 

p 

F NxL F^ NxL o 2 N /P (5) 



Ree = F NxL F^ NxL [^Y, X] MXM)F NxL F\ 



NxL 



k=l 



In flU) and ©, F NxL denotes the normalized NxL FFT matrix with elements e~^ 2n/N /VN in row /i 
and column v, and <j 2 n is the AWGN variance. For the last step in flH]) we assumed the use of constant 
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modulus pilot symbols |Xj[A;]| = 1 (we note that in cases where bit-loading is applied, constant modulus 
symbols will still be used for the pilots in the packet header). We observe from (J3J) and (J^) that the LSE 
channel estimate is disturbed by correlated Gaussian noise with variance 

q L/ / \ 

°E = Jjp a N = V&N ■ (6) 

In order to keep complexity low we do not attempt to exploit the correlation, and we further assume that 
because of interleaving the effect of correlation is negligible. We will refer to parameter 77 = L/(NP) in 
(JIjJ) when evaluating the performance of Multiband OFDM with imperfect CSI in Sections 14.3.21 and 15.11 
In the remainder of this paper, we assume the maximum impulse response length (i.e., the length of the 
cyclic prefix) of L = 32. 

2.3.2 Diversity Combining, Demapping, and Decoding 

Maximum-ratio combining (MRC) [23J in the case of time and/or frequency spreading (see Section l2~T1 and 
P]) and demapping in the standard BICM fashion are performed based on the channel estimator output 
H. The resulting "soft" bit metrics are deinterleaved and depunctured. 

Convolutionally coded schemes use a soft-input Viterbi decoder to restore the original bit stream, 
requiring a decoding complexity of 64 trellis states searched per information bit. Turbo-coded schemes are 
decoded with 10 iterations of a conventional Turbo decoder using the log-domain BCJR algorithm [24J, 
with a complexity of roughly 10-2-2-8 = 320 trellis states searched per information bit (i.e. 10 iterations 
of two 8-state component codes, and assuming that the BCJR algorithm is roughly twice as complex as 
the Viterbi algorithm due to the forward- backward recursion). RA decoding is performed by a turbo-like 
iterative decoder, using a maximum of 60 iterations and an early-exit criterion which, at relevant values 
of SNR, reduces the average number of decoder iterations to less than ten [23] . We note that the per- 
iteration decoding complexity of the RA code is less than that of the Turbo code (since only a 2-state 
accumulator and a repetition code are used), making the total RA decoder complexity slightly more than 
the convolutional code but less than the Turbo code. The increased decoder complexities of the Turbo 
and RA codes, compared to the convolutional code, are reasonable considering the performance gains they 
provide (see Sectional). 
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3 UWB Channel and Diversity Analysis for Multiband OFDM 

The UWB channel model developed under IEEE 802.15 |7] (see Section |2~2"|) is a stochastic time-domain 
model. In this section, we consider a stochastic frequency-domain description, i.e., we include transmitter 
IFFT and receiver FFT into the channel definition and consider realizations of H in (J2J). In doing so, we 
intend to (a) extract the channel parameters relevant for the performance of OFDM-based UWB systems, 
(b) examine whether the Multiband OFDM proposal is adequate to exploit the channel characteristics, (c) 
quantify the impact of the different UWB channel types on system performance, and (d) possibly enable a 
classification of the UWB channel model into more standard channel models used in communication theory. 

From (J2J) we observe that the OFDM transmit signal experiences a frequency non-selective fading 
channel with fading along the frequency axis. Thereby, the outer lognormal shadowing term is irrelevant for 
the fading characteristics as it affects all tones equally. Hence, the lognormal shadowing term is omitted in 
the following considerations. Denoting the lognormal term by G, we obtain the corresponding normalized 
frequency-domain fading coefficients as 

Hf = Ht/G . (7) 

3.1 Marginal Distribution 

The first parameter of interest is the marginal distribution of Hf , i.e., the probability density function (pdf) 

p(Hf). 

First, we note that the frequency-domain coefficient Hf is a zero mean random variable since the 
time-domain multipath components are zero mean quantities. Furthermore, we have observed that Hf is, 
in good approximation, circularly symmetric complex Gaussian distributed, which is explained by the fact 
that Hf results from the superposition of many time-domain multipath components. Since these multipath 
components are mutually statistically independent, the variance of Hf is independent of the tone index i. 

Figure |21 shows measurements of the pdfs p(\Hf\) of the magnitude frequency-domain gain \Hf\ for 
the different channel models CM1-CM4, obtained from 10000 independent realizations of channel model. 
As can be seen, the experimental distributions agree well with the exact Rayleigh distribution of equal 
variance, which is in accordance with the statements above. We note that similar conclusions regarding 
the frequency-domain gains were obtained independently in [9]. 
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3.2 Correlation 

The findings in the previous section indicate that the OFDM signal effectively experiences a (classical) 
frequency non-selective Rayleigh fading channel (along the OFDM subcarriers). Therefore, knowledge of 
the second-order channel statistics, i.e., the correlation between different fading coefficients Hf and HJ, 
i 7^ j, is important for the design and assessment of diversity techniques such as coding, interleaving, and 
frequency hopping, which are envisioned in the Multiband OFDM system. Since coding is performed over 
all bands, we consider all 3 bands jointly. 

As an appropriate figure of merit we examine the ordered eigenvalues of the 3-N x 3-N correlation 
matrix RH n H n of H n = [Hf . . .H^. N ] T . Figure El shows the first 40 ordered eigenvalues (every second 
from 1st to 21st, and the 30th and 40th) of the measured RH n H n , which has been obtained from averaging 
over 1000 channel realizations, as a function of the total employed signal bandwidth. We only show results 
for channel models CM1 and CM3, which constitute the two extreme cases as the corresponding impulse 
responses have the least (CM1) and most (CM3) independent multipath components. The respective curve 
for model CM2 lies in between those for CM1 and CM3. 

From Figure Q] we infer the following conclusions: 

1. By increasing the bandwidth of the OFDM signal the diversity order of the equivalent frequency- 
domain channel, i.e., the number of the significant non-zero eigenvalues of RH n H n , is improved, 
since, generally, more time-domain multipath components are resolved. However, a 1500 MHz total 
bandwidth provides already > 40 (CM3) and > 30 (CM1) strong diversity branches. This indicates 
that the 528 MHz bandwidth and 3-band frequency hopping of Multiband OFDM is a favorable 
compromise between complexity and available diversity. 

2. Since the system, comprising the convolutional code (see Section 12. lj) with free distance < 15 
(depending on the puncturing) and spreading factor 1, 2, and 4, can at best exploit diversities of 
order 15, 30 and 60, respectively, bandwidths of more than 500 MHz per band would only be beneficial 
for the lowest data-rate modes, and then only for very low error rates. Similar considerations apply 
to concatenated codes (e.g. Turbo and RA codes as considered in this paper), which do not exceed 
convolutional codes with spreading in terms of free distance. 

3. Though CM3 provides higher diversity order than CM1, the latter appears advantageous for high 
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data-rate modes with code puncturing due to its larger first ordered eigenvalues. 
In summary, we conclude that, given a particular realization of the lognormal shadowing term, the equivalent 
frequency-domain channel H in fl^l is well approximated by a Rayleigh fading channel with relatively high 
"fading rate", which increases from CM1 to CM3. 



The purpose of this section is to quantify potential data rates and power efficiencies of OFDM-based 
UWB transmission. Of particular interest here are (a) the channel capacity and cutoff rate, 5 which are 
widely accepted performance measures for coded transmission using powerful concatenated codes and 
convolutional codes, respectively, (b) the influence of the particular channel model (CM1-CM3), and (c) 
the effect of imperfect channel estimation on these measures. Since coding and interleaving are limited 
to single realizations of lognormal shadowing, we focus on the notion of outage probability, i.e., the 
probability that the instantaneous capacity and cutoff rate for a given channel realization H fall below a 
certain threshold. These theoretical performance measures will be compared with simulation results for the 
Multiband OFDM system in Section El 

In Section PTT1 we review the instantaneous capacity and cutoff rate expressions for BICM-OFDM, and 
extend these expressions to include systems with bit-loading. The required conditional pdf of the channel 
output is given in Section 14.21 Sections l4.3l and l4~4l contain numerical results for systems without and with 
loading, respectively. 

4.1 Capacity and Cutoff Rate Expressions 
4.1.1 Without Bit-Loading 

The instantaneous capacity in bits per complex dimension of an N tone BICM-OFDM system in a frequency- 
selective quasi-static channel is given in [26J (by extending the results of |6J) as 



5 lt is important to note that the capacity and cutoff rate discussed here are constellation-constrained, i.e., they are 
calculated assuming a given input constellation with uniform input probabilities. 



4 Capacity and Cutoff Rate Analysis 




8 
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In (JSJ, m is the number of bits per symbol, X is the signal constellation and X£ is the set of all constellation 
points X G X whose label has the value b G {0, 1} in position £, p(Yi\Hi, X^) is the pdf of the channel 
output Yj for given input Xi and channel estimate H i: and E z {-} denotes expectation with respect to z. 
For Multiband OFDM, X is the QPSK signal constellation and m = 2 is valid. 

Similarly, we can express the instantaneous cutoff rate in bits per complex dimension as (cf. e.g. |q1I2"o]) 



MH)=m(l-log 2 (B(H) + l)) 
with the instantaneous Bhattacharya parameter (b denotes the complement of b) 



(9) 



1 m N 



1=1 i=l 



\ 



E p(Yi\Hi,Xi 
E p{Yi\Hi,*i 



(10) 



4.1.2 With Bit-Loading 

The instantaneous capacity in bits per complex dimension of an N tone BICM-OFDM system using loading 
can be found by extending Eqs. flE} and (0) (following the methodology of (SI EE]) as 



N mi 



( 



C{H) =m-i^£ E bXi I log 



i=l £=1 



E p(Xi\Hi,Xi 



\ 



E p{Yi\Hi,Xi) 



(ii) 



/ 



In (II ip . fh is the average number of bits/symbol (m = 2 throughout this paper), and Xi are the 
number of bits per symbol and the signal constellation for the i th tone, respectively, and Xf b is the set of 
all constellation points X G Xi whose label has the value b G {0, 1} in position L 

Similarly, we can express the instantaneous cutoff rate for bit-loading systems in bits per complex 
dimension as 

Ro(H) = m(l - log 2 (B(H) + 1)) (12) 
with the instantaneous Bhattacharya parameter 



N mi 



N * — ' *■ — ' rri j 

i=\ 1=1 1 



\ 



E p(Yi\Hi,Xi 



Xiexf 



E p(Yi\Hi,Xi 
Xtex* 



> . 



(13) 
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4.2 Conditional PDF 

In order to calculate capacity and cutoff rate, we require the conditional pdf p(Y i \H i , JQ). In the case of 
perfect CSI we have Hi = H it and p(Yi\Hi, X^) is a Gaussian pdf with mean HXi and variance o~ 2 N . 

We now obtain p(Yi\Hi, Xi) for the more realistic case of imperfect CSI assuming the application of 
LSE channel estimation as described in Section 12.31 According to the results of Section 13.11 and since 
channel estimation is performed for one realization G of the lognormal shadowing term, we further assume 
zero-mean circularly symmetric Gaussian distributed channel coefficients Hi with variance o 2 H = G 2 (see 
Eq. (J7])). This means that Hi is also zero-mean Gaussian distributed with variance cr 2 ^ = <j 2 H + o~ 2 E (see 
Eqs. (J3J) and (0)). Let \i be the correlation between Hi and Hi, 



= w/^i = = rx: (14) 

o-ho-h V °E + °h V 7 + V 

where (•)* denotes the complex conjugate, 77 is defined in (JSJ, and 7 = <J 2 H /o~ 2 N is the SNR. Then, we can 
arrive via algebraic manipulations at (cf. e.g. |27] ) 



p(Y i \H l ,X l )= 1 exp(- ^ YT-^rr \l 1 • ( 15 ) 

7T«(r//i 2 + l)) \ cr 2 N (r]fi 2 + V 



2' 



The Gaussian density of f|15j) implies that the system with imperfect CSI can be seen as a system with 
perfect CSI at an equivalent SNR of 

_ E^mi 2 }^ _ 7 
7e ^(^ 2 + i) ^(1 + 1) + 1 ' 1 } 

We note that in the high SNR regime the loss due to estimation error reaches a constant value of l/(r/ + 1). 

4.3 Numerical Results — No Loading 

We evaluated expressions flS} and d^J) via Monte Carlo simulation using 1000 realizations of each UWB 
channel model CM1-CM3. To keep the figures legible, we present representative results for CMl and CM3 
only. The performance of CM2 (not shown) is between that of CMl and CM3 (cf. also Section 13.21) . 
For comparison we also include results for independent and identically distributed (i.i.d.) Rayleigh fading 
on each tone and an outer lognormal shadowing term identical to that of the UWB models (labeled as 
"Rayleigh + LN"). 
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4.3.1 Perfect CSI 

First, we consider the case of perfect CSI. Figure El shows the outage capacity Pr{C(H) < R} (left) and 
cutoff rate Pr{i? (^) < R} (right) as a function of the threshold rate R for 101og 10 (^ s /jV ) = 5 dB 
and 10 dB, respectively, where E s is the average received energy per symbol and Mq denotes the two-sided 
power spectral density of the complex noise. 

It can be seen that both capacity and cutoff rate for the UWB channel models are similar to the 
respective parameters of an i.i.d. Rayleigh fading channel with additional lognormal shadowing. In fact, 
the curves for CM3, which provides the highest diversity (see Section are essentially identical to those 
for the idealized i.i.d. model. The high diversity provided by the UWB channel also results in relatively 
steep outage curves, which means that transmission reliability can be considerably improved by deliberately 
introducing coding redundancy. This effect is slightly more pronounced for the capacity measure relevant 
for more powerful coding. On the other hand, the effect of shadowing, which cannot be averaged out by 
coding, causes a flattening towards low outage probabilities < 0.1. In the high outage probability range 
we note that CM1 is slightly superior to CM3, which is due to the large dominant eigenvalues of CMl 
identified in Section I3~21 

In Figure EJwe consider the 10% outage 6 capacity and cutoff rate as a function of the SNR 
10 log 10 (£'< j /Ao). Again we note the close similarity between the UWB channel models and the i.i.d. 
Rayleigh fading channel with lognormal shadowing. A comparison of the capacity with the corresponding 
cutoff rate curves indicates that decent gains of 2.5 dB to 3 dB in power efficiency can be anticipated 
by the application of more powerful capacity approaching codes such as Turbo or RA codes (see also the 
simulation results in Section IS~Tj) instead of the convolutional codes proposed in [3] which usually perform 
in the vicinity of the cutoff rate. 

4.3.2 Imperfect CSI 

Figure E] shows the SNR loss due to LSE channel estimation according to Eq. ([To]) with various values of 

r\. For reference, the Multiband OFDM system uses P = 2, N = 128, and so choosing L = 32 (equal to 

the cyclic prefix length) leads to rj = 0.125. 

6 We note that 10% outage is a typically chosen value for UWB systems and the considered channel model 
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We can see from Figure |5]that the performance penalty 101og 10 (7/7 e ) due to imperfect CSI is about 
0.5 dB in the range of interest for the Multiband OFDM system. The actual loss in E s /J\f is slightly 
different, since 7 in Figure El is for a fixed lognormal shadowing and the actual E s /J\f loss must be 
obtained by averaging over the lognormal pdf. However, we can see from Figure El that the SNR loss 
is relatively constant for relevant values of 7, which (since the lognormal shadowing has a dB mean), 
results in E s /J\f loss of approximately 10 log 10 (7/7 e ). Reducing the channel estimation overhead to P = 1 
(?7 = 0.25) could be an interesting alternative for short packets, as the additional loss is only about 0.5 dB 
(in terms of required energy per information bit Eb the loss is even smaller). Further reduction of pilot 
tones is not advisable as the gains in throughput are outweighed by the losses in power efficiency. 

4.4 Numerical Results with Bit-Loading 

In this section, we examine the capacity and cutoff rate of systems employing the Piazzo and CCB loading 
algorithms. We evaluated expressions (II ip and (1120 via Monte Carlo simulation as discussed in Section 1331 

4.4.1 No Clustering 

Figure El (lines) shows the 10% outage capacity and cutoff rates for the CM1 channel using the Piazzo and 
CCB loading algorithms. (The markers in this figure will be discussed in Section I5~2"]) . It should be noted 
that E s is not adjusted to account for tones carrying bits, because we assume operation at FCC transmit 
power limits, precluding the re-allocation of power from unused tones to other subcarriers (which would 
put the transmit power spectral density beyond the allowed limits). We also do not adjust for the overhead 
associated with the feedback of loading information from the receiver to the transmitter. For high rates, 
both the CCB and the Piazzo loading algorithms provide a gain of several dB in capacity and in cutoff rate 
compared to the unloaded case, and this gain grows with increasing rate and E s /J\f . The Piazzo algorithm 
is sub-optimal because it considers only the relative SNR between tones, and loads according to BER using 
a power minimization criterion. This loading strategy is not guaranteed to produce an increased channel 
capacity (or cutoff rate). On the other hand, the CCB algorithm requires knowledge of the actual SNR 
values of each tone and loads according to their approximate capacities, resulting in an increased channel 
capacity for all SNR values and an improved performance compared to Piazzo loading. 
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4.4.2 Clustering 

We next consider the application of clustered loading using the modified CCB algorithm as described in 
Section 12.1.21 Figure |H] shows the 10% outage capacity (solid lines) and cutoff rate (dashed lines) for 
various values of cluster size D, for channels CMl and CM3. Also included for comparison are the non- 
clustered loading (D = l) and unloaded (all-QPSK) curves. As the cluster size D increases the attainable 
rates decrease because the modulation scheme chosen for each cluster is not optimal for all tones in the 
cluster. This loss is slightly more pronounced for the cutoff rate than for the capacity, which indicates that 
when using clustered loading we should expect more performance degradation with convolutional codes 
than with Turbo codes (see also Section l5~2"j) . The performance degradation with increasing cluster size is 
higher for CM3 than for CMl, which can be predicted from the correlation matrix results of Section I3~2l 
Specifically, we note from Figure QJthat the frequency responses of adjacent subcarriers are more correlated 
(fewer significant eigenvalues) in CMl and less correlated (more significant eigenvalues) in CM3. The less 
correlated the tones of a cluster are, the higher the average mismatch between the optimal modulation for 
each tone (i.e. that chosen by the non-clustered loading algorithm) and the fixed modulation chosen for 
the cluster. The higher average mismatch on CM3 results in lower performance when clustered loading is 
applied. 

5 Simulation Results 

In Section 15.11 we study Turbo, RA, and convolutional coding without bit-loading. We examine channel 
CMl with four different transmission modes with data rates of 80, 160, 320, and 480 Mbps corresponding to 
0.25, 0.50, 1.00, and 1.50 bit/symbol, respectively. In the simulations, detection is performed with perfect 
CSI as well as with LSE channel estimation using 77 = 0.125. We then turn to the performance of systems 
with loading in Section l5~2l Based on the results of the information-theoretic analysis of Section 14.41 we 
restrict our attention to rates > 1.00 bit/symbol, where we expect loading algorithms to yield performance 
gains. We concentrate on Turbo and convolutional codes for this section. The simulation results presented 
in these two sections are the worst-case 10 log w (E s /J\f ) values required to achieve BER<10 -5 for the 
best 90% of channel realizations over a set of 100 channels (i.e. they are simulation results corresponding 
to 10% outage). 
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In Section 15.31 we briefly summarize the power efficiency gains and attendant range improvements 
expected from the application of the system extensions we have proposed. 

5.1 No Loading 

Figure El (markers) shows SNR points when using convolutional codes (as in the Multiband OFDM proposed 
system), together with the corresponding 10% outage cutoff rate curves. We observe that the simulated 
SNR points are approximately 3 dB to 4 dB from the cutoff-rate curves, which is reasonable for the channel 
model and coding schemes under consideration. These results (a) justify the relevance of the information- 
theoretic measure and (b) confirm the coding approach used in Multiband OFDM. More specifically, the 
diversity provided by the UWB channel is effectively exploited by the chosen convolutional coding and 
interleaving scheme. Furthermore, the system with LSE channel estimation performs within 0.5-0.7 dB of 
the perfect CSI case as was expected from the cutoff-rate analysis (see also the discussion in Section I?. 3. 21 
on the relationship between the loss 10 log 10 (7/7 e ) and the 101og 10 (£' s /A/o) loss). 

We next consider the Turbo and RA coding schemes. Figure [TU] (markers) shows the simulation results 
for Turbo and RA codes on channel CM1 with perfect CSI, as well as the convolutional code results 
for comparison. We also show the corresponding 10% outage capacity and cutoff rate curves. Turbo 
codes give a performance gain of up to 5 dB over convolutional codes, and perform within 2.5 dB of the 
channel capacity, depending on the rate. At rates of 0.25 and 0.50 bit/symbol, Turbo code interleaver 
sizes compatible with the channel interleaver design of the Multiband OFDM proposal (the "std" points) 
incur a performance penalty of 1-2 dB compared with the longer block length ("K=600") points. Repeat- 
accumulate codes have a performance roughly 1 dB worse than the long block-length Turbo codes, but the 
RA codes are both (a) compatible with the Multiband OFDM channel interleaver, and (b) less complex to 
decode. They are thus a good candidate for low-rate Multiband OFDM transmission. 

5.2 With Loading 

Figure|7](markers) shows the simulation results for Turbo codes and for convolutional codes, using both the 
CCB and Piazzo loading algorithms on channel CM1 with perfect CSI. At 1.00 bit/symbol and using con- 
volutional codes, we see a performance gain of less than 1 dB using CCB loading, and a slight performance 
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degradation using Piazzo loading. Performance using Turbo codes at 1.00 bit/symbol is relatively constant 
regardless of loading. However, at 1.50 bit/symbol we see gains of approximately 1.5 dB for Turbo codes 
and almost 4 dB for convolutional codes when CCB loading is used. The gains using the Piazzo algorithm 
are approximately 1 dB less, as predicted by the capacity analysis of Section I4~4l Finally, we note that at 
1.50 bit/symbol the system employing CCB loading and Turbo codes is approximately 6 dB better than the 
unloaded convolutionally coded system, and performs within approximately 2.5 dB of the channel capacity. 

In Figure |S] (markers) we consider the performance of clustered loading with Turbo codes and with 
convolutional codes for 1.50 bit/symbol on the CMl and CM3 channels with perfect CSI. As predicted by 
information-theoretic analysis, clustered loading incurs a performance penalty with increasing cluster size D. 
We note that Turbo codes suffer a smaller performance degradation (relative to D = l) than convolutional 
codes, because the more powerful Turbo code is better suited to handle the mismatched modulation (as 
discussed in Section I4.4.2|) . The performance degradation is larger for CM3 due to that channel model's 
lower correlation between adjacent subcarrier frequency responses and resultant larger loading mismatch. 
However even D = 10 loading provides performance gains for both channels and code types. Cluster size 
D = 2 is a good tradeoff point for both Turbo and convolutional codes, allowing for feedback reduction by 
a factor of 2 with losses of approximately 0.1 dB for CMl and 0.4 dB for CM3. Cluster sizes as large as 
D — 5 could be used with Turbo codes, depending on the required power efficiency and expected channel 
conditions. 

5.3 Range Improvements from Turbo Codes and Loading 

Table [TJ lists the gains in required 10 log 10 (E s /J\f ) and percentage range increases on channel CMl for 
various combinations of the extensions we have proposed. We assume a path loss exponent of d = 2, as in 
[8]. We can see that bit loading alone provides up to 47% increase in range, Turbo codes without loading 
provide a 71% increase, and the combination of Turbo codes and loading allows for a 106% increase in 
range. Furthermore, the use of clustered loading with D = 2 only reduces these range improvements by 
1% to 3% over the non-clustered case, while providing reduced-rate feedback and lower computational 
complexity. 
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6 Conclusions 

In this paper, the application of Multiband OFDM for UWB communication has been analyzed. We 
have shown that the UWB channel model developed under IEEE 802.15 is seen by OFDM systems in the 
frequency domain as Rayleigh fading with additional shadowing. The 528 MHz signal bandwidth chosen 
for Multiband OFDM essentially captures the diversity provided by the UWB channel. As a result, we 
have found that the information-theoretic limits of the UWB channel are similar to those of a perfectly 
interleaved Rayleigh fading channel with shadowing. The BICM-OFDM scheme proposed for Multiband 
OFDM performs close to the outage cutoff-rate measure and is thus well suited to exploit the available 
diversity. The application of stronger coding, such as Turbo codes or Repeat-Accumulate codes, improves 
power efficiency by up to 5 dB, depending on the data rate. Bit-loading algorithms provide additional 
performance gains for high data rates, and a simple clustering scheme allows for reduced-rate feedback of 
loading information depending on the channel conditions and required power efficiency. Finally, a simple 
LSE channel estimator has been shown to enable performance within 0.5-0.7 dB of the perfect CSI case 
for the proposed Multiband OFDM system. 
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Figure 1: Block diagram of Multiband OFDM transmission system. 
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Figure 2: Distributions of normalized channel magnitude \Hf \ for channel types CM1-CM4. For compar- 
ison: Rayleigh distribution with same variance. 
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Figure 3: First 40 ordered eigenvalues of the correlation matrix Rh' h 11 ( ever Y second from 1st to 21st, 
and the 30th and 40th). 
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Figure 4: Outage probability for 101og 10 (i? s /Ao) = 5 dB and 10 dB and perfect CSI. Left: Outage 
capacity. Right: Outage cutoff rate. 
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Figure 5: 10% outage capacity and cutoff rate for perfect CSI. 
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Figure 7: 10% outage capacity and cutoff rate with and without loading for CM1 (lines). 10 \og 10 (E s /J\f ) 
required to achieve BER < 10~ 5 for the 90% best channel realizations using convolutional and Turbo codes, 
with and without loading (markers). 
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101og 10 (^ s /A/" ) [dB] required for 90% BER < 10~ 5 



Figure 8: Lines: 10% outage capacity (solid) and cutoff rate (dashed) for clustered CCB loading (cluster 
sizes D e {1,2,5,10}) and for non-loaded QPSK ("NIL"). Markers: 101og 10 (^ s /jV ) required to achieve 
BER < 10~ 5 for the 90% best channel realizations using Turbo codes (□ markers) and convolutional codes 
(x markers). Channels CMl (top) and CM3 (bottom). 
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Figure 9: 101og 10 (£' s /Ao) required to achieve BER < 1CT 5 for the 90% best channel realizations using 
convolutional codes (markers). For comparison: 10% outage cutoff rate (lines). Channel model CMl and 
LSE channel estimation. 
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Figure 10: 101og 10 (£ , s /A/"o) required to achieve BER < 10~ 5 for the 90% best channel realizations using 
Turbo Codes, RA codes, and convolutional codes (markers). For comparison: 10% outage capacity and 
cutoff rate (lines). Channel model CMl and perfect CSI. 
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Table 1: Power efficiency gains and range increases available using some of the extensions considered, 
compared to the Multiband OFDM standard proposal. Channel CM1, rate 1.50 bit/symbol (480 Mbps), 
path loss exponent d = 2. 10 \og 10 (E s /J\f ) values are those required to achieve BER < 10~ 5 for the 90% 
best channel realizations. (CC: convolutional code, TC: Turbo code). 
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